Evolutionary dynamics from a variational principle 



> 
O 



Peter Klimek^; Stefan Tliurner^'^'*; Rudolf Hanel^; 
^Complex Systems Research Group; Medical University of Vienna; Wdhringer Giirtel 18-20; A-1090; Austria; 
Santa Fe Institute; 1399 Hyde Park Road; Santa Fe; NM 87501; USJcl 

We demonstrate with a thought experiment that fitness-based population dynamical approaches 
to evolution are not able to make quantitative, falsifiable predictions about the long-term behavior 
of evolutionary systems. A key characteristic of evolutionary systems is the ongoing endogenous 
production of new species. These novel entities change the conditions for already existing species. 
Even Darwin's Demon, a hypothetical entity with exact knowledge of the abundance of all species 
and their fitness functions at a given time, could not pre-state the impact of these novelties on 
, established populations. We argue that fitness is always a posteriori knowledge - it measures but 

■ does not explain why a species has reproductive success or not. To overcome these conceptual 
' limitations, a variational principle is proposed in a spin-model-like setup of evolutionary systems. 
, We derive a functional which is minimized under the most general evolutionary formulation of a 

dynamical system, i.e. evolutionary trajectories causally emerge as a minimization of a functional. 
This functional allows the derivation of analytic solutions of the asymptotic diversity for stochastic 
) ^ evolutionary systems within a mean-field approximation. We test these approximations by numerical 

^— ( , simulations of the corresponding model and find good agreement in the position of phase transitions 

' in diversity curves. The model is further able to reproduce stylized facts of timeseries from several 

' man-made and natural evolutionary systems. Light will be thrown on how species and their fitness 

landscapes dynamically co-evolve. 

■ PACS numbers: 05.65.-|-b, 87.10.Ed, 87.23. Cc 
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O^! I. INTRODUCTION 

Evolutionary dynamics appear in a multitude of different contexts. Evolution basically describes how sets of 
^ ' elements, such as biological species, goods and services in an economy, groups of living beings, or chemical compounds, 
change over time. Examples are abundant in various areas. Chemical compounds react with other compounds to 
produce new chemicals. Integrated circuits performing specific computational tasks can be combined to create another 
circuit for a different computational task. Prey and predator may co-evolve by succinctly acquiring new traits and 
thereby develop into new species. In the following we will use species for elements in whatever context, chemicals, 
goods, biological species, etc. The removal or addition of a single species in an evolutionary system may have dramatic 
consequences. For example, in starfish removal experiments (e.g. Mukkaw Bay in Washington [l[) starfish are removed 
' from an eco-system with the consequences that mussel populations explode and drive out most other species, while the 
• • urchin population destroys coral reefs. In 1904 English physicist John Ambrose Fleming accidentally manufactured 
. ^ ' tti6 first vacuum tube which triggered a cascade of technological and economic co-evolutions and adaptations; in 2004 
. the semiconductor industry was a market of $ 213 billion and enabled the generation of approximately $ 1200 billion in 
' electronic system businesses and $ 5000 billion in turn in services which amounts to 10% of world GDP 0. Typically in 
5^ 1 evolutionary systems species are endogenously added or removed from a large system of mutually influencing species. 
Two species influence each other if the existence of one species has a positive or negative effect on the change of 
abundance of the other. The possibilities for interactions in evolutionary systems involve different natural, economic 
or social laws on a variety of time or length scales. The collective result of these ('microscopic') interactions between 
elements leads to ubiquitous well-known macro phenomena in evolutionary systems, such as punctuated equilibria, 
booms of diversification, breakdowns and crashes, or seemingly unpredictable responses to external perturbations. 
Maybe one of the most exciting questions in natural sciences today is to understand if evolutionary dynamics can be 
understood by a common underlying principle and - if yes - how such a principle might look like. Such a principle 
must be general enough to capture the multitude of different phenomena, and at the same time must be in a form 
which can be applied easily to specific problems. 

In the present understanding of evolution the concept oi fitness is of central importance. Usually the relative abun- 
dance of species (with respect to other species) is described by replicator equations (first-order differential equations) 
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and variants such as Lotka-Volterra equations [1, 0, @ . Their mutual influence is quantified by a rate specifying how 
the abundance of one species changes upon contact with another. In biology this rate is called Malthusian fitness, in 
chemistry one refers to it as the reaction rate, in economics it is related to production functions. Similar proliferation 
rates could also be introduced for technological, financial or even historical contexts. In the following we subsume 
them all under the term fitness. 

A distance between two species can be defined as the minimal number of evolutionary steps needed for one species 
to evolve into the other one [oj (in biology this distance is often the number of single-point mutations two species 
differ in). In this way a metric is given on the space of all possible species. A fitness landscape assigns to each 
point in this space (that is to each species) its reproductive success or fitness. Evolution is sometimes pictured as an 
optimization problem where species evolve and co-evolve via adaptations toward peaks in this landscape These 
peaks represent regions of high reproductive success, whereas valleys correspond to low fitnesses. As these peaks and 
valleys are spread over larger regions of species space, the fitness landscape becomes less rugged. If one species moves 
toward a peak it may change the fitness of other species which are then moving on this fitness landscape too Q - 
one speaks of co-evolutionary cascades. The higher the degree of 'ruggedness' (i.e. the smaller the average distance 
between adjacent adaptive peaks or valleys), the higher the probability that these cascades of co-evolution last for 
long times. With lower ruggedness, however, the probability increases that each species reaches an adaptive peak and 
the evolutionary dynamics comes to a halt. 

The concept of fitness is limited however. To see this consider the following thought experiment. Suppose one -say 
a demon- would have exact knowledge about the abundance and fitness of each biological species in the universe. 
'Knowing the fitness of a species' means knowledge of the functional dependence of its proliferation rate on the entire 
current environment (i.e. all other species). The omniscient hypothetical entity in possession of this knowledge could 
be called Darwin's Demon for obvious reasons. The demon may be pictured as a super-biologist, able to measure 
each species' abundance as well as the dependence of its proliferation rate on each other species in each habitat. That 
is, he knows the set of all existing species and can measure their associated fitness landscape to an arbitrary degree 
of exactness. What can the demon predict about the future course of evolutionary events, such as biodiversity in 
100 million years or the time to the next mass extinction event? Surprisingly little, for the following reasons. A key 
characteristic of evolutionary systems is its potential to generate innovations, i.e. new species. In biological systems 
this can happen through mutations, in technological or economical ones through spontaneous ideas of an inventor, etc. 
Once a new species is created it becomes part of the environment and thereby potentially changes the conditions for 
all already existing species and for those yet to arrive. To now assess the fitness of a new species one has to measure 
how it spreads in an environment it is now part of. The demon has information related to a different environment, 
one which only existed before arrival of the new species. Thus the demon may have an exact description of the current 
biosphere, but with the advent of each new species this description loses accuracy. Fitness thus always encodes a 
posteriori knowledge, and can not be used to make falsifiable predictions. It is not fruitful to predict future fitness of 
species from their present fitness. Instead one has to understand how species and their fitness landscapes co-construct 
each other, how they co-evolve. 

To make headway in understanding the phenomenology of evolution, i.e. in identifying principles which guide 
evolutionary dynamics, a series of quantitative models have been suggested [1, |^, [l^, lU, [H, [3, Hi [Hi Here 
explicit assumptions are made about how new species come into being, how they interact with each other and under 
which conditions or under which selective forces they vanish. Each of these models focuses on particular aspects of 
evolution. For example in Kauffman NK models ^] species are bit-strings with randomly assigned fitness values. 
Arthur 9] focuses on technological evolution with integrated circuits as species whose fitness is examined by how well 
they execute certain computational tasks. Jain and Krishna [13] consider ecological systems and elucidate the interplay 
between interaction topology and survival of species. In most of these models some ad hoc assumptions about the 
mechanisms have to be made. In the model by Jain and Krishna, for example, species are actively removed and added 
to the system, innovations are externally enforced and not endogenously produced. In |9|] the output of randomly 
assembled circuits is compared to a prespecified list of desired computational tasks (such as bitwise addition). In 
the NK model evolutionary interactions are constrained to actions on bit-strings. Although these assumptions are 
certainly reasonable in the specific contexts of their models, it is not at all clear whether conclusions derived on the 
basis of these assumptions are universally valid in different contexts. 

To arrive at a general evolutionary description (without ad hoc specifications) one has to identify principles which 
are abstract enough to be applicable in each evolutionary context but which must be specific enough to make useful 
quantitative predictions. To meet these requirements, evolution can be pictured as a three-step production/destruction 
process. Step 1: New species come into being through recombination of already present species. That is, each species 
arises only under the condition that a given (and maybe not unique) set of other species or environmental factors 
exists. For example, to assemble an MP3-player all parts -including software- are needed. Sodium chloride can be 
produced by sodium hydroxide in solution with hydrochloric acid. With a substantial degree of oversimplification 
one can say that apes in combination steppe formation give rise to mankind. Step 2: The new species becomes part 
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of the system and can now be combined with other, already existing species. One can legally download music for 
the MP3-player and listen to it, sodium chloride reacts with e.g. calcium carbonate in the Solvay process, mankind 
burns forests to create fields for agriculture. Step 3: As a consequence, through this recombination yet new species 
may come into being and other already existing ones may vanish or be destroyed. For example, MPS currently 
drives CDs out of the market but can be combined with cell phones to give smartphones. Soda ash can be used to 
remove sulfur dioxide from flue gases in power stations which might help to reduce the ongoing Holocene extinction 
event of biological species [iTj possibly influenced by the advent of mankind. In previous work models incorporating 
these types of production and destruction processes have been shown to reproduce a wide range of evolutionary 
phenomena, including booms of diversification jTH, breakdowns of diversity [31 or punctuated equilibria [23|. Such 
processes further allow to understand stylized facts in time-series data on evolutionary systems, such as scale-free 
distributions of species lifetimes, the number of species per genus or the size of extinction events in fossil data [2l[, 
or GDP and business failures in economic markets [2^ . 

In this work we propose a variational principle from which dynamics - identical to the dynamics of the production 
/ destruction processes described above - can be derived. To this end we define the evolutionary potential of a species. 
This function measures in how many productions and destructions a species would (no longer) take part if it would 
enter (be removed from) the system. With this potential one obtains two formal representations of the system's 
dynamic: (i) The potential can be used to explicitly deduce a set of dynamical update equations of system diversity 
for production / destruction processes, (ii) Using this evolutionary potential and a measure for ongoing productions 
and destructions one can derive a balance function. The evolutionary process solving the dynamical update equations 
(i) always minimizes the balance function (ii). The balance function further allows asymptotic solutions for the system 
diversity (mean- field approximation). These analytic solutions are in good agreement with numerical simulations of 
the full model of productions and destructions. This is to a certain degree unexpected since the dynamics is dominated 
by strong and nonlinear interactions. 

This description of evolutionary systems allows to understand how the set of existing species and their fitness 
landscapes co-construct each other from first principles, as opposed to research strategies portrayed by Darwin's 
Demon, where snapshots of regions of fitness landscapes are empirically explored. Accordingly the focus shifts 
from predicting microscopic properties such as individual proliferation rates to estimating the occurrence of global, 
macroscopic events. 

This work is structured as follows. In section |TT] we develop a general framework for evolutionary systems via 
a variational principle. We discuss deterministic and stochastic implementations and obtain asymptotic diversity 
solutions in a mean-field approximation. In section Hill we motivate and define the choice of evolutionary interactions 
as production and destruction rules as in [2^ . We treat the special cases of systems with only productive interactions 
in section llVl and the pure destructive case in section|Vl Then we discuss the full model of productions and destructions 
in section IVIl We discuss empirical relevance of this work in section I VIII and turn to a conclude in IVIIII 



II. GENERAL FORMULATION OF DIVERSITY DYNAMICS 



A. Dynamical systems 

The abundance of species i is given by a binary state variable cFi{t) e {0, 1}. If species i exists at time t, ai{t) — 1, 
otherwise ai{t) = 0. The system can be populated by N species (N arbitrarily large, even infinite). A particular 
configuration of the system is characterized by the A^-dimensional vector in phase space (?(<) — {o'i{t)) G F = {0, 1}^. 
The system's diversity D{t) is given by D{t) = ai{t). 

At each time, species i may experience three scenarios, (i) annihilation cFiit) = 1 (Ti{t + 1) = 0, (ii) nothing 
ai{t) = ai{t+l) or (iii) creation ai{t) = — > ai{t+l) = 1. Suppose that there exists a function fi{(j{t)) : {0, 1}''^ R 
indicating which of the transitions (i)-(iii) takes place. Specifically, let fi{S{t)) indicate the following transitions 

(i) h{a{t))<Q ^a,{t + !) = {) 

(u) h{S{t)) = ^ a,{t + 1) = n,{t) (1) 
(iii) h{S{t))>Q ^a,{t + l) = l 

For (i) or (iii) a transition occurs if ai{t) = 1 or 0, respectively. That is, if fi{(j{t)) > the system evolves according 
to 

cr,(t+l) = (T,(i)H-A(T,(i) with Aa,:(t) = sgn[(l-cr,(i))/,(CT(i))]. (2) 

Aai{t) can only be non-zero if a-i{t) — and fi{<j{t)) > 0. Similarly, for fi{a{t)) < Aai{t) = sgn [—ai{t)fi{(T{t))]. 
Let us define the ramp function R(x) by R{x) = a; iff a; > and R{x) = iff a; < 0. Using these definitions we can 
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TABLE I: We exhaustively insert all possible values for ai{t) and (7^(t) in Eq.Q and Bi for non-zero values of fi{a{t)). For 
convenience we choose \fi{ff{t))\ = 1 and confine the threshold to < p < 2. We mark the values of (j'i{t) for which Bi is a 
minimum by an underscore. For these marked values o"i(t) is always equal to ai{t + 1). 
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jenerically map the indicator function fi from Eq.([T]) onto the update equation 
a,{t + l) = a,{t) + Aa,{t) , 



sgn 



(3) 



B. Variational principle for deterministic diversity dynamics 

We introduce a distance function to quantify the number of state changes in the system. Consider a virtual 
displacement of (Ji{t), <j[{t) = Ui{t) + 5(Ji{t). A quadratic distance function is given by 



K,{o[{t),a,{t)) 
where > 0^. Analogously a potential Vi is defined by 



^^[a[{t)-amf 



l~a[{t)^R{jMt))) -a',it)R(^-Mm?) 



(4) 



(5) 



which 'counts' the number of possible interactions for the displaced state o'^(t). Depending on cr^(i), Eq.® will 
reduce to Vi{(j'i{t),(7{t)) = \R{±fi{(T{t)))\. A possible intuition behind Eq.© is that fi acts as a 'field' on <Ji{t) which 
is 'probed' by cr[{t). We occasionally drop the cr-dependence for a clearer notation. Finally we define the balance 
function, Bi = Ki + Vi. 

Ki measures the actual activity in the system - it counts all state changes. The potential Vi counts the potential 
activity in the newly obtained states. Bi therefore contains the full dynamical information of Eq.ll} which can now 
be expressed through a variational principle. 



Given S{t), the solution ai(t + 1) of Eq.(^ is identical to the value of a[{t) for which Bi assumes its 
minimum, i.e. 



ai{t 



1) = argmin 



B,[al{t),a{t) 



(6) 



with argmin [/(a;)] denoting the value of x for which f{x) takes its minimum. 



This is proved by exhaustive insertion. First, consider the case fi{a{t)) — 0. From Eq.([2]) it follows that Aai{t) = 
and Vi — 0. The only possible term contributing to Bi is Ki; Ki — if cr^(t) = a-i{t) and Ki = ^ > otherwise. 
The balance function Bi takes its minimum, = at <J^(t) — ai(t) = (7i{t + 1). Similar reasoning can be applied to 
the cases of non-zero fi{<j{t)), see TablH This now clarifies the role of the parameter /i. It can be seen as an inertial 
threshold; the dynamics of Eq.Q only takes place if the 'field' fi{(j{t)) describing a certain state-change exceeds the 
barrier set by There is always a choice for /i such that Eq.fS]) holds. 



Note the similarity to kinetic energy in classical mechanics. 
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C. Stochastic diversity dynamics 

There exists a natural stochastic variant of diversity dynamics. In Eq.Q a state transition ai{t) ai{t + 1) is 
determined by Aai(t) e { — 1, 0, 1}. For the stochastic case we specify transition probabilities for this evolution. 

From the variational principle Eq.® it follows that Eq.Q always minimizes the balance function B^. In the 
stochastic variant we assume that the lower Bi, the higher is the probability to find the system in the respective 
configuration ai (t) . In analogy to spin systems this probability is a Boltzmann factor 

oce-''^'(^(*)) , (7) 

with P = 1/T the inverse temperature. To obtain transition probabilities we demand detailed balance 

p{ai{t) ai{t)) p{a,{t}) ' 

with Bi = Bi{ai{t), a{t)j^i). There are several ways to choose transition probability such that Eq.® is satisfied, here 
we use Metropolis transition probabilities p((Ti(i) — > (Ji(t)) = lifBi — Bi < and p {a i{t) ai{t)) = exp[—f3{Bi — Bi)] 
otherwise. The stochastic diversity dynamics is fully specified by setting^ (Ji(t + 1) = o'i(t). 

Whereas in the deterministic case the balance function Bi is minimized, the stochastic diversity dynamics shows 
'disordering effects' due to non-zero temperature T as given in Eq.(I7]). We quantify this with Boltzmann-Gibbs 
entropy. 



D. Mean- field approximation 



Denote the expectation value of ai{t) by qi{t) — {(Ji[t)) and assume that the probability distribution factorizes, 
i.e. p{alt)) = WiPiio'iit)) with pi{ai(t)) = (1 — qi{t))5„.(f)fi + 1i{t)^iJi{t).i- In this mean-field approximation the 
Boltzmann-Gibbs entropy s for species i is 



s{a,{t)) = -{lnp,{a^(t)))=s{q,{t)) , 

s{q,{t)) = - (^l - q,{t)^ ln{l -q,{t))-q,{t) In q,{t) . 

The 'free energy' functional (j){ai{t)) for the system turns for this approximation into 



(9) 



(10) 



The asymptotic state of species i, qi{t — > oo) = qi, is identified by a minimum in free energy. The necessary condition 
for this, d^iq,)/dq, = 0, is ^ + ^In = 0, and 



|tanh 


[ f3d{B,)l 






2 dqr _ 





(11) 



The self-consistent solution of Ea. (fTT|) yields the asymptotic configuration. 



III. GENERAL FORMULATION OF EVOLUTIONARY INTERACTIONS 

Traditionally in the master equations framework"^ interactions are classified by transfer rates for abundances of 
species. The transfer rates measure how the change in abundance of a given species i is related to the abundance of 
other species ji, j2, . . . . Depending on how i and ji, j2, ■ • ■ are chosen, one obtains different systems of differential 



^ One can also define dynamics 'backwards' by the transition probability p{!ji{t) — > cFi{t)) which can be interpreted as inferring a{t) from 

the knowledge of -1-1). 
^ as is typical for traditional evolutionary biology. 
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TABLE II; Summary of the traditional evolutionary interaction mechanisms: replication, competition, mutation and recom- 
bination. We indicate how the constructive/destructive set j has to be specified in Eq. (|12|l in order to recover the various 
mechanisms in our model. 

mechanism power set notation 



replication Xi —V 2xi j= {i} 

competition Xt+Xj — » Xi j= j} 

mutation Xj — > Xi j= {j} 

recombination Xj + Xh xi j= {j, k) 



equations which can be related to a specific form of evolutionary interactions. If species i with abundance Xi replicates 

with rate /i, the interaction is of type replication and is represented as xi ^ 2xi, (replicator equation [3]). Competition 
is a mechanism where the replication rate of species i also depends on other species j through a transfer rate pij, 

This type of interactions is used in the game dynamical equation 23], which is a special case of 
the frequency dependent replicator equation [23| . The mechanism mutation assigns a mutation or transfer rate 

between two species according to Xi Xj , together with replication and competition we obtain the replicator- mutator 
equation [2^, of which the quasispecies equation [2^ is a special case. Replication can take place without replicators, 
species are then produced by recombination processes. In the case of three species i, j and k with a recombination 
rate atjk, this mechanism is Xj + Xk xi. The corresponding dynamical system is called catalytic network see e.g. 
[27l |. It is formally possible to express replication, mutation and competition as special cases of the recombination 
mechanism [28l [29j . In this sense recombination mechanisms provide a unifying description of the other evolutionary 
interactions above - an observation we use as a starting point for our model. 

In the general form of a recombination process an arbitrary number of species ji, j2, ■ ■ ■ , jn influences a given species 
i. We distinguish two types of interactions of this form, (i) constructive interactions or productions where species i 
benefits from species ji, ■ ■ ■ ^jn and (ii) destructive interactions or destructions where the j's are causing harm to 
i. In the master equation framework constructive interactions correspond to positive transfer rates, destructions to 
negative ones. We denote the set of species ji, j2, ■ ■ ■ , jn = j- If the set of all species is A/", j is an element of the set 
of all subsets of A/", i.e. j is an element of the power set of A/", V{N). A recombination always maps an element from 
'P{N) to an element from M via a transfer rate a^j, i.e. by a map a : V^M) M. From now on italic indices refer 
to elements of N, e.g. i G M, while bold-face indices refer to elements of the power set, j G V{M). Transfer rates are 
represented by their sign. For convenience define binary state variables for sets of nodes, let CTj(i) — Higj The 
most general form of evolutionary interactions can then be written as 

<J^-^<y^ ■ (12) 

We summarize in Tab|lT] how the evolutionary interaction mechanisms of replication, competition, mutation and 
recombination are contained in Eq. fT^ for special choices of j. If there is a constructive interaction between j and «, 
i.e. j is a constructive set, we capture it in the production rule table : V[M) M with a^j = 1, otherwise a^j = 0. 
Similarly, if the interaction in Ea. (|12p is destructive, i.e. j is a destructive set, we record this in the destruction rule 
table : 'P(A/') M with a^j = 1, otherwise — 0. At some points in this work we will assume that the rule 
tables a* are random tensors. In this case they are given by two parameters and r^. n"^ is the cardinality 
of constructive sets, |j| — n"*" in Eq. (|12p and for random each species i has on average the same number of 
constructive sets, (^j Similarly a" is given by n~ and r~. 



IV. CONSTRUCTIVE INTERACTIONS 

A. Constructive dynamical system 

We first consider a system with constructive interactions only. We read Ea. p2| as 'from Oj(t) = 1 follows that 
ai{t+\) — V . In a chemical setting the chemical compounds contained in j react to give compound i, in an economic 
setting the goods j can be assembled to produce good i, see FiglTJa). The constructive dynamical system characterized 
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(a) (b) (c) 

FIG. 1: A graphical representation of constructive, destructive and combined interactions, (a) The constructive set j (green 
area) contains three species (circles) j = {I, m, n}. They produce species i, i.e. af. = 1. The constructive set is active (indicated 
by the blue color of the circles) we have crj(t) ~ 1 and ai{t) — 0, by Ea. (|13|l ai{t + 1) — 1. (b) The destructive set j' (red striped 
area) of cardinality two is active since each of its contained species m',n' is active and interacting with species i' through 
a~, y — 1. Following Ea. (|18p . (Ji'{t) = 1 will be deactivated, ^^/(t + 1) = 0. (c) A pictorial description of a network with both 
constructive and destructive interactions at a point in time. 



by Eq. fTSl) is given by 



Aa,{t) = sgn I ( 1 - a,{t) ) ^ a+.a^it) | , (13) 



i.e. Ma{t)) from Eq.© becomes f+{ff{t)) = Eia^f^jW- 



B. Deterministic constructive diversity dynamics 



In the limiting case of T = i.e. (3 ^ oo the system detcrministicahy obeys the dynamics of Eq. p^ . The behavior 
of D{t — > oo) is well understood; this case is identical to the model studied in [H,[l^ for random interaction topologies 
given by n"*", r"*" here. D{oo) was computed as a function of , r"*" and D(0). It was shown that this system has a 
phase transitions formally equivalent to the phase transition of a van der Waals Gas. There exists a critical diversity 
of initial species Dc{0) above which the system is driven toward an almost fully populated state; below this threshold 
the dynamics freezes. All these findings are identical what we find here in the T = case. 



C. Stochastic constructive diversity dynamics 



We next turn to non-zero temperature T. The crucial feature distinguishing deterministic and stochastic diversity 
dynamics is the dependence on the initial conditions. In the presence of stochastic perturbations the final diversity is 
not a function of the initial diversity D{0). We employ a mean- field approach by assuming that the expectation value 
of a product equals the product of expectation values, {gi{a)g2{cr)) — {gi{a)){g2{o'))- The expectation value of the 
constructive potential {Vj^)pi^g(^t)) of species i is 

quantifying what could be produced given the actual configuration of the system^. 



* Note that the structure of has a strong similarity to the potential of the paradigmatic Ising model. Our model diverges in the 
following ways: (i) interactions are defined not between nodes but between constructive sets and nodes and (ii) interactions are not 
symmetric, the action of j on j does not equal the action of j on j. 
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As mentioned above, plays the role of a threshold. For < < 2 a species gets activated by one constructive 
set, for 2 < /J, < 4 at least two constructive sets are needed and etc. From now on we fix the threshold /i = 1. The 
contribution to free energy is K^{t) = ^{Aai{t)y . We can estimate the expectation value {Ki{t))p(^s(t)) by making 
use of the dynamical relation Eq. (jl3p . and get for the mean- field assumption 

{Kt),iSit)) = \[(l-q^{t)) E <i'Y{l^m ■ (15) 

Using this in Eq. (|lip gives us the mean-field solution for arbitrary interaction topologies . To compute it explicitly 
we assume random interaction topologies. The aim is to derive an expression for in the limit t ^ oo. Note 

that ^Q^^ — — TlijeViM) '^i^j rijej 1j- ^'^^ to the randomness in a+ the same average 'field' is exerted on each species. 
With q = (qi}i we get = — r+q"^. We apply the same reasoning to the distance-contribution (iir+)p(CT(t)) ■ By 

first carrying out the derivation and then putting in the assumptions about we get 

^ = -rV"-(l-9)(r+g""r , (16) 



and the self-consistent solution for the asymptotic abundance q, 



g = i <{ tanh 



|(r+g"^ + (l-g)(rV^)2) 



1 , (17) 



from which the diversity follows as D{t oo) — Nq. We compare predictions of Eq. pT)) with simulations results 
from a Metropolis algorithm. The latter was implemented in the following way: We constructed a random a+ and 
initialized the system with a random initial condition (t(0). After initialization the algorithm applies the following 
procedure to each species once within one timestep (random sequential update) : 

• Pick a species i randomly. 

• Calculate Bi = Ki + Vi, according to Eqs.Q and (O with cr^(t) — cri{t). 

• Calculate B^^ Ki + Vi with a'f {t) ^1- ai{t). 

• Calculate AB = B, (^a'^{t) = 1 - cr,(t)^ - B^ ^(T^(t) = cr,{t) 

• If AS < set a,{t + 1) = 1 - <T,{t). 

• If AB > set CT,,(i + 1) = 1 - a,{t) with probabihty e"'^'^^. 

We executed the algorithm for one particular realization of a+ for 10^ timesteps and averaged over this time-span 
after discarding transient behavior (typically about 50 iterations). We performed simulations for system sizes of 
TV = 10^ — 10^ without noticing size effects on the results. However, the time-to-converge depends on N. We show 
the degree of agreement of simulations and Eq. ([T7]) in Fig^a) . 

V. DESTRUCTIVE DYNAMICS 

A. Destructive dynamical systems 

Assume now that only destructive interactions take place, e.g. two chemicals catalyzing the consumption of another 
chemical species, or biological species gaining (in symbiosis) an evolutionary advantage over another species. Eg. p^ 
is now read as 'from (Jj{t) = 1 follows ai{t -I- 1) = 0', see FiglT^b). To formulate this as a dynamical system as in 
Eq.© set Ma{t)) ^ .fr{a{t)) = - Ej ".j'^j (i) to get 

Aa,(t)=sgn -a,(t) ^ «i:jf^j(i) ■ (18) 
V i&v{^f) J 



We discuss the deterministic (T = 0) and stochastic {T > 0) scenario. 
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(a) (b) (c) 




FIG. 2: Diversity as a function of inverse temperature /? for various dynamical systems obtained from a mean-field approach 
(MF, lines) and Metropolis simulations (symbols), (a) Constructive dynamics with r"*" = 2. (b) Destructive dynamics with 
r~ = 2. (c) Combined dynamics with — 2>, r~ = 1 and n~ — 2. 



B. Deterministic destructive diversity dynamics 

In the deterministic case the asymptotic diversity D{t — > oo) is a function of the initial diversity. Let us discuss the 
case of a completely random destructive rule table a~ . By denoting qit) = D{t)/N we can derive an update equation 



for q{t) following the same reasoning as in [18|, q{t + 1) = q{t) — A q{t) with A q{t) = r q{t) yq"" (t) — {t— 1) 

In the limit of sparse rule densities this leads to q(t — > oo) — q{0) — n^r^q"^ In contrast to constructive 
dynamics, destructive dynamics do not exhibit a phase transition. With more species being destroyed the number 
of deactivated destructive sets increases even faster, thus the process comes to a halt without reaching a strongly 
unpopulated state. Note that other kinds of destructive dynamics may exhibit phase transitions. For example, if one 
requires for each species to be abundant at least one productive set to be abundant, that is species become extinct 
once they are not actively produced, the removal of a small number of species may trigger a cascade of extinction 
events which erases the entire population p^ . 

1. Stochastic destructive diversity dynamics 

For a stochastic variant of the destructive dynamical system of Eq. we repeat the analysis of the constructive 
case. We start with the corresponding destructive potential and distance terms, 

jGP(A/') j£} 

2 



(i^rW(t)) = o U.W V ■ (19) 



We proceed with the derivation of the destructive balance function Bi and get 



dq 

and the self-consistent solution for the asymptotic abundance q, 



= r-q'' +q{r~q'' Y , (20) 



<?=i|tanh -^(r-g""+g(r-g"")2 



(21) 



We compare this prediction to results of a Metropolis simulation in Figl^l^b). As is seen in the — 1 case, the 
deviation between Eq. (j2ip and simulations increases with p. For higher n~ and r" the same extent of deviation occurs 
for a higher value of /3. The mean-field approximation starts to significantly differ from simulations once entropic 
effects become negligible and the system's evolution approaches the deterministic scenario, that is e"'^'^'^' ^ 1/N Vi 
(on average less than one random state flip per iteration). To approximate V~{t) = ai(t) a~j Hjej ^j^^) ^"^y 
time t we have to consider the species which have not been deactivated at i — 1 - the system possesses memory. This is 
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not captured in the mean- field approximation {V~ ) = r~q" where we assume the populated species to be randomly 
distributed over N possible species at each time t. In the destructive case the mean-field approach thus works best 
whenever the random fluctuations are large enough to 'smear out' this memory effect, otherwise the system is better 
approximated by the deterministic description. 



VI. COMBINED DYNAMICS 



A. Combined dynamical systems 

We now study the interplay of both constructive and destructive dvnamics [20L |2^ : the situation is sketched 
in Fig[TJc). Destructive interactions represent an implicit selection mechanism [22|. Each species may be targeted 
(influenced) by constructive and destructive interactions. Assume that each interaction has equal influence. If the 
constructive forces outweigh the destructive ones the species prefers to be active and vice versa. For some systems 
other choices of weighting could be more appropriate (e.g. assuming that one destructive interaction outweighs any 
number of constructive ones - 'it is easier to destroy than to build'). It is straight-forward to incorporate alternative 
weighting schemes in the present framework. 

To combine constructive and destructive interactions we add their indicator functions, 



and get for the dynamical equation 

Acri(t) = sgn 



1 - <j,{t) \R{U{a{t))) - a,{t)R{-U{a{t))) 



(22) 



(23) 



The purely destructive or constructive dynamical systems are recovered by setting a = 



B. Deterministic combined diversity dynamics 



To obtain an estimate for the asymptotic diversity, we again use an update equation and combine the finding for 
the constructive and destructive cases. If we denote the average in-(de)crements in the constructive (destructive) 
scenario by Ag+(t) ( Ag~(t)), we study the update equation q{t -|- 1) = q{t) + Aq~^{t) — Aq~{t). This equation is 
solved by using the same Ansatz as in [ll|, yielding q{t 00) — q{0) — n^r^q^^ +^ -I- n+r+(l — g)q." . 

C. Stochastic combined diversity dynamics 

Let us calculate {Bi) for the stochastic scenario. The expectation value of the distance contribution, {Ki)pi^g(^i)-^, is 
more involved now. Constructive (destructive) dynamics take place under the condition that fi{(T{t)) > 0(< 0). Start 
with an expression for the probability that fi{a{t)) is positive (negative), . Consider random interaction topologies 
specified by and rv^. Define p{k,r^) as the probability that there are exactly k active constructive interactions, 

that is p{k, r+) = {^^)q"^''{l — q"^Y^^''. Analogously, q{l, r~) is the probability that exactly I out of destructive 
interactions are active. Then 

p+ = ^p{k,r+) E li^r^) : 

k=l 1=0 
r~ min(^— l,r"'") 

p- = Y.q{l,r-) E ^^(^'^"') • (24) 

1=1 k=0 

The average distance follows as 



{Kt)p{a) = 0(1- <li)P + <liP' 



(25) 
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FIG. 3: We compare the distribution of systemic observables of evolutionary systems with those of the combined stochastic 
model for two different parameter settings: Simulation 1 with 13 = 15, — 5, = 2 and Simulation 2 with (3 = 15, = 8, 
r~ = 12, — 2. Each distribution has been normalized (sum over all data points equals one), (a) The percent change of 
GDP of the UK since 1950 is compared to the model, (b) The reaction rate distribution in the model and in the metabolic 
network of E. coli is shown, (c) Species lifetime distributions as found in fossil data are well reproduced with the model. 



and, abbreviating fi{a{t)) = f^, the potential is 

- \il -q^)Ri.f^)-q^Ri~f^)\ 

Taking the derivative with respect to qi the mean-field result is 



dqi 



-9" + r-q'^" - [(1 - q)p+ + qp-] {p+ - p') 



with the self-consistent solution for the asymptotic abundance q 

"/3 



q = — ■{ tanh 
2 ' 



L (r+g"^ - r-g"" + [(1 - q)p+ + qp-] (p+ - p')) 



(26) 



(27) 



(28) 



Again we compare the mean- field prediction to results of a Metropolis simulation of the full model in Fig[2I^c). 



VII. DISCUSSION ON EMPIRICAL RELEVANCE 



A. Economical setting 



We interpret the model in different evolutionary contexts and compare its behavior to measured data. In an 
economic setting one can identify the number of active interactions as a measure for the productive output of an 
economy - for example the GDP An interaction is defined to be active iff cr.i(t) = o'j(i) a^j — 1. We show in 
FigEl^a) a comparison between the actual distribution of percent increments of the GDP of the UK and the number 
of active productions from the combined stochastic model for two different parameter settings. In one setting (3 = 15, 
= 5, = 2 is used, the other has a denser interaction topology, f3 = 15, r+ = 8, r~ = 12, = 2. Both model 
and real-world GDP timeseries produce fat-tailed distributions, with power exponents in the range between -2 and 
-4. These features are also found in GDP timeseries of other countries and for a wide range of model parameters, see 
e.g. HI. 



B. Chemical setting 



Another possible interpretation of the combined stochastic system is a chemical reaction network. In this case 
chemical species j = {ji, j2, • ■ • } are producing or degrading chemical i. There are N(r^ + r^) reactions. A reaction 
rate is defined as the frequency with which a certain reaction is active and a reaction is active if af^ = 1, ai(t) = 1 
and <Tj(t) = 1. This is compared to reaction rates in the metabolic network of E. coli [s^l in Fig[3l^b). Distributions of 
reaction rates in both cases, model and living organism, are fat-tailed. Least-squares fits to model power-laws yield 
exponents in the range of —1 to —3, depending on parameters. This compares well to the value of —1 found for E. 
coli. 
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C. Biological evolution setting 

Translated into a macro-ecological setting, one can compare the distribution of lifetimes of species in the combined 
stochastic model (number of iterations a given species is unintermitted abundant) with the distribution of species 
lifetimes in fossil data [3l| in Fig[3]Jc). Again one finds power-laws in the model with exponents between —2 and —4, 
which matches well with the paleontologic data, which suggest slopes between —2 and —3. Note that there is a strong 
dependence on the values used for the fit. We work with an intermediate choice in Fig[3I^c). 



VIII. DISCUSSION 



We propose a general framework to systematically study a large class of dynamical evolutionary systems defined 
on an arbitrary large number of species. The trajectory of existence of each species is governed by a function 
incorporating information of the surroundings - the existence of other species. We show how to express the resulting 
system dynamics via a variational principle. We discuss deterministic and stochastic variants. For the latter we 
derive a closed expression for the asymptotic diversity of evolutionary systems within a mean-field approximation. 
We discuss the quality of this approximation with respect to Metropolis simulations of the full model. Although 
the model explicitly introduces strong correlations between species' abundances, the mean-field approximation for 
asymptotic diversities match the simulation data surprisingly well. The model can be seen as a generalization of 
several previous models, which are contained as special cases. The deterministic constructive case is identical to 
the random catalytic networks studied in [l3|. In the model of Sole and Manrubia ^l5| only linear interactions are 
allowed (i.e. |j| = 1 in Ea. (jl2p ) and new species are created not through endogenous recombinations, but by an 
explicit mutation mechanism. As discussed in [13], fi{d{t)) in our combined stochastic model plays the identical role 
as the randomly assigned fitness values in the Bak-Sneppen model 11]. To recover the N K -model Q as a special 
case associate each species with a bit-string. A random fitness value is then assigned to each species' bit-string, in 
some variants of the model also in dependence of a given number of bits of other species' strings. However, fitness in 
our framework is a topological property of the entire system plus the set of abundant species, whereas in TVif-models 
fitness is basically a mapping of random numbers to bit-strings. 

We find that the model of constructive and destructive interactions reproduces stylized facts of man-made 
(economies) and natural evolutionary systems (metabolic networks, macro-ecology) across different orders of magni- 
tude. We belief this adds empirical substance to our claim that we have identified a crucial and ubiquitous building 
block of evolutionary systems with recombinatory, non-linear interactions within a simple binary framework. The 
model systematically expands on the idea that the concept of fitness is an a posteriori concept. Fitness in the tradi- 
tional sense can of course be reconstructed for every timestep in our model. It is nothing but the co-evolving network 
of rates of the actually active (productive) processes at a given time, see ^2^ for more details. It becomes clear that 
fitness can not be used as concept with much predictive value, even if 'Darwin's Demon' knowing all mutual influences 
at a given time would exist. The proposed model is free of 'Darwin's Demon'. 

This work was supported in part by the Austrian Science Fund, FWF P19132. 
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